AITopics | free text

Collaborating Authors

free text

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Simulated Self-Assessment in Large Language Models: A Psychometric Approach to AI Self-Efficacy

Jackson, Daniel I, Jensen, Emma L, Hussain, Syed-Amad, Sezgin, Emre

arXiv.org Artificial IntelligenceNov-27-2025

Self-assessment is a key aspect of reliable intelligence, yet evaluations of large language models (LLMs) focus mainly on task accuracy. We adapted the 10-item General Self-Efficacy Scale (GSES) to elicit simulated self-assessments from ten LLMs across four conditions: no task, computational reasoning, social reasoning, and summarization. GSES responses were highly stable across repeated administrations and randomized item orders. However, models showed significantly different self-efficacy levels across conditions, with aggregate scores lower than human norms. All models achieved perfect accuracy on computational and social questions, whereas summarization performance varied widely. Self-assessment did not reliably reflect ability: several low-scoring models performed accurately, while some high-scoring models produced weaker summaries. Follow-up confidence prompts yielded modest, mostly downward revisions, suggesting mild overestimation in first-pass assessments. Qualitative analysis showed that higher self-efficacy corresponded to more assertive, anthropomorphic reasoning styles, whereas lower scores reflected cautious, de-anthropomorphized explanations. Psychometric prompting provides structured insight into LLM communication behavior but not calibrated performance estimates.

large language model, machine learning, qwen3, (21 more...)

arXiv.org Artificial Intelligence

2511.19872

Country: North America > United States > Ohio (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing

Chen, Binger, Bök, Tacettin Emre, Rasti, Behnood, Markl, Volker, Demir, Begüm

arXiv.org Artificial IntelligenceNov-24-2025

Foundation Models (FMs) are increasingly used in remote sensing (RS) for tasks such as environmental monitoring, disaster assessment, and land-use mapping. These models include unimodal vision encoders trained on a single data modality and multimodal architectures trained on combinations of SAR, multispectral, hyperspectral, and image-text data. They support diverse RS tasks including semantic segmentation, image classification, change detection, and visual question answering. However, selecting an appropriate remote sensing foundation model (RSFM) remains difficult due to scattered documentation, heterogeneous formats, and varied deployment constraints. We introduce the RSFM Database (RS-FMD), a structured resource covering over 150 RSFMs spanning multiple data modalities, resolutions, and learning paradigms. Built on RS-FMD, we present REMSA, the first LLM-based agent for automated RSFM selection from natural language queries. REMSA interprets user requirements, resolves missing constraints, ranks candidate models using in-context learning, and provides transparent justifications. We also propose a benchmark of 75 expert-verified RS query scenarios, producing 900 configurations under an expert-centered evaluation protocol. REMSA outperforms several baselines, including naive agents, dense retrieval, and unstructured RAG-based LLMs. It operates entirely on publicly available metadata and does not access private or sensitive data.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2511.17442

Country:

North America > United States (0.46)
North America > Canada (0.46)
Europe > Austria (0.28)

Genre:

Research Report (0.64)
Workflow (0.47)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

[Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI

Pielka, Maren, Schneider, Tobias, Terheyden, Jan, Sifa, Rafet

arXiv.org Artificial IntelligenceNov-5-2024

We present an outline of the first large language model (LLM) based chatbot application in the context of patient-reported outcome measures (PROMs) for diabetic retinopathy. By utilizing the capabilities of current LLMs, we enable patients to provide feedback about their quality of life and treatment progress via an interactive application. The proposed framework offers significant advantages over the current approach, which encompasses only qualitative collection of survey data or a static survey with limited answer options. Using the PROBot LLM-PROM application, patients will be asked tailored questions about their individual challenges, and can give more detailed feedback on the progress of their treatment. Based on this input, we will use machine learning to infer conventional PROM scores, which can be used by clinicians to evaluate the treatment status. The goal of the application is to improve adherence to the healthcare system and treatments, and thus ultimately reduce cases of subsequent vision impairment. The approach needs to be further validated using a survey and a clinical study.

application, prom, protobot, (14 more...)

arXiv.org Artificial Intelligence

2411.02973

Country:

Europe > Germany > North Rhine-Westphalia > Cologne Region > Bonn (0.04)
North America > United States (0.04)

Genre:

Questionnaire & Opinion Survey (0.69)
Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback

Evaluating Large Language Models for Public Health Classification and Extraction Tasks

Harris, Joshua, Laurence, Timothy, Loman, Leo, Grayson, Fan, Nonnenmacher, Toby, Long, Harry, WalsGriffith, Loes, Douglas, Amy, Fountain, Holly, Georgiou, Stelios, Hardstaff, Jo, Hopkins, Kathryn, Chi, Y-Ling, Kuyumdzhieva, Galena, Larkin, Lesley, Collins, Samuel, Mohammed, Hamish, Finnie, Thomas, Hounsome, Luke, Riley, Steven

arXiv.org Artificial IntelligenceMay-23-2024

Advances in Large Language Models (LLMs) have led to significant interest in their potential to support human experts across a range of domains, including public health. In this work we present automated evaluations of LLMs for public health tasks involving the classification and extraction of free text. We combine six externally annotated datasets with seven new internally annotated datasets to evaluate LLMs for processing text related to: health burden, epidemiological risk factors, and public health interventions. We initially evaluate five open-weight LLMs (7-70 billion parameters) across all tasks using zero-shot in-context learning. We find that Llama-3-70B-Instruct is the highest performing model, achieving the best results on 15/17 tasks (using micro-F1 scores). We see significant variation across tasks with all open-weight LLMs scoring below 60% micro-F1 on some challenging tasks, such as Contact Classification, while all LLMs achieve greater than 80% micro-F1 on others, such as GI Illness Classification. For a subset of 12 tasks, we also evaluate GPT-4 and find comparable results to Llama-3-70B-Instruct, which scores equally or outperforms GPT-4 on 6 of the 12 tasks. Overall, based on these initial results we find promising signs that LLMs may be useful tools for public health experts to extract information from a wide variety of free text sources, and support public health surveillance, research, and interventions.

evaluation, llm, public health, (13 more...)

arXiv.org Artificial Intelligence

2405.14766

Country:

North America > Canada (0.04)
South America > Colombia > Meta Department > Villavicencio (0.04)
South America > Brazil (0.04)
(9 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Public Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

QAnswer: Towards Question Answering Search over Websites

Guo, Kunpeng, Defretiere, Clement, Diefenbach, Dennis, Gravier, Christophe, Gourru, Antoine

arXiv.org Artificial IntelligenceJan-17-2024

Question Answering (QA) is increasingly used by search engines to provide results to their end-users, yet very few websites currently use QA technologies for their search functionality. To illustrate the potential of QA technologies for the website search practitioner, we demonstrate web searches that combine QA over knowledge graphs and QA over free text -- each being usually tackled separately. We also discuss the different benefits and drawbacks of both approaches for web site searches. We use the case studies made of websites hosted by the Wikimedia Foundation (namely Wikipedia and Wikidata). Differently from a search engine (e.g. Google, Bing, etc), the data are indexed integrally, i.e. we do not index only a subset, and they are indexed exclusively, i.e. we index only data available on the corresponding website.

arxiv, paragraph, query, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3487553

2401.09175

Country:

Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom (0.04)
Europe > Italy (0.04)

Genre: Research Report (0.50)

Industry: Government (0.46)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.89)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.83)

Add feedback

SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

Liu, Shicheng, Xu, Jialiang, Tjangnaka, Wesley, Semnani, Sina J., Yu, Chen Jie, Dávid, Gui, Lam, Monica S.

arXiv.org Artificial IntelligenceNov-16-2023

Many knowledge sources consist of both structured information such as relational databases as well as unstructured free text. Building a conversational interface to such data sources is challenging. This paper introduces SUQL, Structured and Unstructured Query Language, the first formal executable representation that naturally covers compositions of structured and unstructured data queries. Specifically, it augments SQL with several free-text primitives to form a precise, succinct, and expressive representation. This paper also presents a conversational search agent based on large language models, including a few-shot contextual semantic parser for SUQL. To validate our approach, we introduce a dataset consisting of crowdsourced questions and conversations about real restaurants. Over 51% of the questions in the dataset require both structured and unstructured data, suggesting that it is a common phenomenon. We show that our few-shot conversational agent based on SUQL finds an entity satisfying all user requirements 89.3% of the time, compared to just 65.0% for a strong and commonly used baseline.

computational linguistic, proceedings, restaurant, (16 more...)

arXiv.org Artificial Intelligence

2311.09818

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(18 more...)

Genre: Research Report (0.82)

Industry: Consumer Products & Services > Restaurants (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

Transformer Based Geocoding

Solaz, Yuval, Shalumov, Vitaly

arXiv.org Artificial IntelligenceJan-2-2023

In this paper, we formulate the problem of predicting a geolocation from free text as a sequence-to-sequence problem. Using this formulation, we obtain a geocoding model by training a T5 encoder-decoder transformer model using free text as an input and geolocation as an output. The geocoding model was trained on geo-tagged wikidump data with adaptive cell partitioning for the geolocation representation. All of the code including Rest-based application, dataset and model checkpoints used in this work are publicly available.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2301.0117

Country:

South America (0.04)
Pacific Ocean (0.04)
Europe > United Kingdom > Scotland > Highland (0.04)
(9 more...)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Sports > Olympic Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback

Machine Learning is the Wrong Way to Extract Data From Most Documents

#artificialintelligenceJul-27-2022, 05:39:03 GMT

Documents have spent decades stubbornly guarding their contents against software. In the late 1960s, the first OCR (optical character recognition) techniques turned scanned documents into raw text. By indexing and searching the text from these digitized documents, software sped up formerly laborious legal discovery and research projects. Today, Google, Microsoft, and Amazon provide high-quality OCR as part of their cloud services offerings. But documents remain underused in software toolchains, and valuable data languish in trillions of PDFs.

document layout, representational mode, template, (12 more...)

#artificialintelligence

Country: North America > United States (0.05)

Industry: Banking & Finance > Insurance (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.55)

Add feedback

DataWords: Getting Contrarian with Text, Structured Data and Explanations

Gallant, Stephen I., Hossain, Mirza Nasir

arXiv.org Artificial IntelligenceNov-9-2021

Our goal is to build classification models using a combination of free-text and structured data. To do this, we represent structured data by text sentences, DataWords, so that similar data items are mapped into the same sentence. This permits modeling a mixture of text and structured data by using only text-modeling algorithms. Several examples illustrate that it is possible to improve text classification performance by first running extraction tools (named entity recognition), then converting the output to DataWords, and adding the DataWords to the original text -- before model building and classification. This approach also allows us to produce explanations for inferences in terms of both free text and structured data.

dataword sentence, modeling, subset, (15 more...)

arXiv.org Artificial Intelligence

2111.05384

Country:

North America > United States > New York (0.04)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.30)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Use Deep Learning to Write Like Shakespeare

#artificialintelligenceAug-24-2021, 05:40:09 GMT

"Many a true word hath been spoken in jest." "O, beware, my lord, of jealousy; It is the green-ey'd monster, which doth mock The meat it feeds on." "There was a star danced, and under that was I born." Who can write like Shakespeare? Or even spell like Shakespeare?

king lear, neural network, shakespeare, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback